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Abstract 

Two new high-throughput techniques, computer tomography X-ray den- 
sitometry (CT scan) and pyrolysis molecular beam mass spectrometry 
(pyMBMS), coupled with quantitative trait loci (QTL) analysis, were tested 
as a means to overcome the time and cost associated with conventional char- 
acterization of biomass feedstock components. Applications of these two 
techniques were evaluated using hybrid poplar for the CT scan and loblolly 
pine for the pyMBMS. Segregating progeny from hybrid poplar varied in 
specific gravity, with individual mean estimates ranging from 0.21-0.41. 
Progeny from loblolly pine varied in lignin, a cellulose, and mannan contents, 
with individual mean estimates of lignin content ranging from 28.7-33.1%, 
a cellulose content from 28.8-43.5% and mannan content from 4.2-10.1%. 
QTL analysis of the loblolly pine data suggested that eleven QTLs were 
associated with individual feedstock characteristics and that two QTLs for 
several feedstock components were linked to the same position on the loblolly 
pine genetic map. Each QTL individually accounted for 7-13% of the total 
phenotypic variation in associated loblolly pine feedstock components. 

Index Entries: Pyrolysis molecular beam mass spectrometry; computer 
tomography; genetic markers; QTL; lignin; cellulose; hemicellulose. 
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Introduction 

Feedstock composition affects product value, product quality, and 
production efficiency in all biomass-based industries. For example, in the 
pulp and paper industry, pulp yields are known to increase with increased 
cellulose content, pulp cohesiveness is positively associated with hemicel- 
lulose content, and energy consumption during chemi-thermomechanical 
pulping is inversely related to lignin content (1). In biochemical conversion 
of lignocellulosic feedstock to ethanol fermentation, efficiency is directly 
related to cellulose:hemicellulose ratios, biocatalyst activity can be inhib- 
ited by feedstock extractives, and the cost and types of feedstock pretreat- 
ment are strongly influenced by lignin content and composition (2), In the 
biomass power industry, slagging of boilers during direct combustion of 
biomass feedstocks is positively associated with feedstock ash content (3), 

Despite the importance of feedstock composition on each of the afore- 
mentioned processes or products, very little has been done genetically to 
improve feedstock quality in the biomass-based industries (4). Two rea- 
sons account for this lack of selection and improvement. First, conventional 
analytical methods often require months of process time per set of samples, 
with each analysis costing himdreds of dollars. Estimates of total compo- 
sition are often inaccurate because of incompatibility of assays, degrada- 
tion of component fractions during specific assays, or the need to use 
subtractive methods to calculate certain feedstock fractions. Secondly, 
genetic selection for improved feedstock quality has been hampered by 
inaccurate phenotypic data, lengthy assays, and long generation intervals 
for most perennial species. Because genetic selection generally relies on 
phenotypic characterization of hundreds of progeny replicated over time 
and space, the cost and the delays associated with estimates of feedstock 
quality from conventional wet chemistry prohibit such assessments as part 
of traditional breeding programs. 

Two new high-throughput analytical approaches, coupled with 
genetic mapping and marker-assisted selection methods (5), promise a 
means to overcome the aforementioned constraints. The two techniques 
are computer tomography X-ray densitometry (CT scan) and pyrolysis 
molecular beam mass spectrometry (pyMBMS). The objectives of this article 
are 1. to describe the general application of CT scan to segregating hybrid 
poplar progeny and pyMBMS to segregating loblolly pine progeny and 
2. to relate the results for loblolly pine to the existing genetic map through 
a quantitative trait loci (QTL) analysis. 

PyMBMS combined with multivariate analysis has been used previ- 
ously to estimate cell-wall chemical composition of different biological 
materials (6-8), These studies have applied pyrolysis mass spectrometry 
techniques to a variety of lignocellulosic materials including wood, herba- 
ceous fiber, and pulps, and have found a correlation between the pyrolysis 
results and the amount of cell-wall constituents determined by conven- 
tional techniques. Agblevor et al. (6) have demonstrated that the combina- 
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tion of pyMBMS and factor analysis can reliably estimate plant cell-wall 
components. Agblevor et al. (6) observed good correlation between total 
pentosans or total hexosans obtained using conventional wet chemistry 
methods and pyMBMS factor scores. However, the pyMBMS/lignin corre- 
lation was very weak, presumably because of degradation products that 
interfered with the Klason lignin analysis. For this study, we propose to use 
projection to latent structure analysis to regress conventional wet chemis- 
try analysis with pyMBMS spectrometric data (9), The advantage of the 
projection of latent structure data reduction approach is that the analysis 
can handle several responses (i.e., chemical/ feedstock components) simul- 
taneously, rather than regressing a single component with a single spectral 
peak or group of peaks. 

Materials and Methods 

Plant Materials 

A three-generation inbred hybrid poplar pedigree containing 375 seg- 
regating progeny was grown through age 4 by Boise Cascade Corp. in 
eastern Oregon. Populus trichocarpa was used as the maternal grandparent 
and Populus deltoides as the paternal grandparent. The segregating progeny 
were obtained by crossing a single full-sibling male with a single full-sib- 
ling female progeny from the initial parental mating. A single 5-mm incre- 
ment core representing both sides of a single tree, passing through or near 
the pith, was extracted 1.5 m above ground level from each member of the 
pedigree, including the parents and grandparents, and were shipped to 
Oak Ridge National Laboratory (Oak Ridge, TN) for further processing. 
Similarly, a three-generation, outcrossed loblolly pine {Pinus taeda L.) pedi- 
gree containing 172 segregating progeny was field-grown through ages 
11-13 by Weyerhaeuser Company in North Carolina and Oklahoma. 
Loblolly pine increment cores were sampled as described for hybrid poplar 
and were shipped to Centralia, WA for further processing. In addition, 
large sections of stem segments were removed from individuals known to 
have extreme specific gravity phenotypes (10) and used as a source of 
materials for conventional compositional analyses. Increment cores for both 
species were dried under vacuum at room temperature and stored at 0°C 
until needed for CT scan or pyMBMS. 

CT Scan: Hybrid Poplar 

Members of the Populus genus are represented by diffused porous 
species with moderate to low specific gravity and a lack of clearly defined 
annual growth rings (11). The CT scans were used to determine both annual 
ring boundaries and within-ring specific gravity. Each increment core was 
removed from storage, dried under vacuum at room temperature, and 
positioned on a wooden mounting block for X-ray scanning. For CT scan, 
each core was subjected to 420 kV, 3 mA coUimated, 0.474-mm thick X-ray 
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beam using a Scientific Measurement Systems Model B201 CAT scanning 
device (12). Each mounting block containing approximately 30 cores was 
translated and rotated 10°/min over an 18-min period in the X-ray beam in 
order to obtain a set of density functions. The density at any point within 
100 |im in the 0.474 mm beam was determined by simultaneously solving 
the set of density functions. The X-ray density data from the reconstructed 
three-dimensional digital image for each core was stored as an array of 
numbers representing local X-ray attenuation that was post-scan pro- 
cessed and converted to specific gravity by an algorithm developed from 
volumetric measurements and direct radiographic methods. Recon- 
structed specific gravity images from each core were used to delineate 
annual growth rings. 

PyMBMS: Loblolly Pine 

The fifth-yr growth ring from each loblolly pine increment core was 
removed, ground in a Wiley mill, and screened through a 0.2-mm mesh. 
These ground samples were then shipped at room temperature, next-day 
delivery to the National Renewable Energy Laboratory (Golden, CO) for 
pyMBMS analysis. Three 15 mg subsamples from each sampled genotype 
(i.e., from all progeny, parents, and grandparents) were removed from 
storage and used in the analysis. Each subsample was placed in a quartz 
boat and introduced into a pyrolysis reactor at 550°C under a helium flow 
rate of 5.0 L/min at 2VC, 0.1 MPa. Vacuum expansion created a molecular 
beam that was introduced into an Extrel Model TQMS C50 mass spectrom- 
eter (electron impact ionization voltage - 22.5 eV), which measured atomic 
mass between 15 and 300 Da in real time during the pyrolysis period (13). 
Total run time /subsample was 1 .5 min; all subsamples were analyzed over 
a consecutive period of time to avoid temporal variability in analytical 
equipment. Spectral data from the mass spectrometer were then analyzed 
using multivariate statistics. 

Genotypes of known extreme phenotypes were selected and used as 
external calibration standards during the pyMBMS analysis. That is, infor- 
mation on an extractive-free-basis for lignin, a cellulose, and hemicellulose 
components, percent extractives, and ash contents were estimated inde- 
pendently by laboratories at PAPRICAN (Pointe-Claire, Quebec, Canada) 
and Michigan Tech University (Houghton, MI) for each set of external cali- 
bration standards. Lignin content was determined by the Klason method, 
TAPPI T222 (14), Additional loblolly pine calibration standards were pro- 
vided by Weyerhaeuser Company (Tacoma, WA). Results from each labo- 
ratory were compared and inconsistencies resolved before initiating the 
pyMBMS analysis. Factor scores from the correlation matrix on normalized 
mass spectral data for each external calibration standard were correlated 
with known feedstock composition. Projection of latent structure models, 
in the form Y = + (principal component X), were developed for each 
feedstock component and used to predict feedstock composition for each 
tested genotype. 
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QTL Analysis: Loblolly Pine 

QTL were characterized using a simple regression method that uti- 
lizes the three-generation structure of the outbred loblolly pine pedigree 
(15,16), This analysis simultaneously uses genetic information from all 
markers of a linkage group to provide a robust test for the presence of 
genetic variation associated with each feedstock component. At regular 
genomic intervals, the probability of each progeny being one of the four 
possible genotypic classes, determined as a function of the grand parental 
genotypes, was calculated. The phenotypes were then regressed onto these 
genotypic probabilities using a multiple regression approach (17). Linear 
regression is relatively simple compared with a maximum likelihood 
approach and therefore allows for testing more complex (and potentially 
more realistic) genetic models (e.g., fitting one vs two QTLs/linkage group). 
On a genome-wide basis, QTLs were identified at either a "suggestive" at 
p < 0.01 (38) or significant at p < 0.005 level. QTLs were then positioned onto 
the loblolly pine consensus map (19). This consensus map was constructed 
from two independent three-generation pedigrees and contains 357 restric- 
tion fragment-length polymorphism, random amplified polymorphic 
DNA, and isozyme genetic markers. The linkage information from these 
two pedigrees was coalesced into 12 integrated linkage groups (and several 
smaller groups) representing approximately 85% of the loblolly pine 
genome. 

Results and Discussion 

Wood Property Phenotypes 
Wood-Specific Gravity 

Estimates of wood-specific gravity based on CT scan varied widely 
within samples, across armual growth rings (Fig. l),and among all sampled 
genotypes (Fig. 2). Increment cores extracted at 1.5 m typically contained 
three discrete growth rings, with wood-specific gravity increasing across 
the growing season within each ring (Fig. 1). The highest wood-specific 
gravity/yr occurred generally at the end of the growing season, whereas 
the lowest occurred at the beginning of each growing season. There was no 
overall trend among years, such that average yr 1 specific gravity was not 
consistently higher or lower than yr 2 specific gravity from one increment 
core to the next. Alternate sides of a single increment core displayed subtle 
differences in specific gravity, though generally maximum and minimum 
values were consistent within rings. However, specific gravity values 
occasionally fluctuated unexpectedly within annual growth rings. This 
fluctuation, visually observed as layers of cells with increased cell-wall 
thickness, may have been caused by changes in water or nutritional status 
of each tree or possibly owing to insect or disease infestations known to 
occur each growing season. 
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Year 3 



Year 3 



Fig. 1. Computer tomography X-ray densitometry results from a single increment 
core extracted at 1.5 m from a 4-yr-old hybrid poplar tree grown in eastern Oregon 
under fertigation. The increment core and CT scan represent both sides of a single tree 
starting with the pith in the middle of the image. 
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Fig. 2. Frequency of earlywood and latewood specific-gravity estimates based on 
computer tomography X-ray densitometry for the penultimate growth ring from 375 
4-yr-old hybrid poplar progeny grown in eastern Oregon. 



Mean "earlywood" specific gravity for the hybrid poplar pedigree 
was 0.29, ranging from 0.21-0.41 (Fig. 2), Note that true earlywood does not 
occur in a diffuse-porous species such as hybrid poplar. We referenced the 
mean increase in density that occurs throughout the growing season 
(as previously noted) as a point that differentiates earlywood from "late- 
wood." Mean latewood specific gravity was 0.40, ranging from 0.29-0.54. 
The phenotypic correlation between these two estimates was 0.77, suggest- 
ing that genotypes that have high earlywood specific gravity will likely 
have high latewood specific gravity. For the P. trichocarpa grandparent, 
estimates of earlywood and latewood specific gravity averaged 0.24 and 
0.34, respectively, vs 0.27 and 0.36, respectively, for the P. deltoides grand- 
parent. The parental-generation specific gravity estimates were consistently 
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Table 1 

Feedstock Composition as Determined 
by Conventional Analytical Chemistry for 6 of the 26 Loblolly Pine Samples 
Used as Calibrated External Standards During pyMBMS Analysis 

% Extractive-free oven dry weight 



Sample ID 


Wood type 


Lignin 


a Cellulose 


Hemicellulose 


Ash 


97-1 


Juvenile 


34.7 


35.8 


25.5 


0.34 




Mature 


32.3 


36.2 


24.4 


0.30 


97-2 


Juvenile 


32.1 


35.7 


25.8 


0.32 




Mature 


32.0 


36.1 


25.9 


0.30 


97-3 


Juvenile 


31.2 


37.4 


25.5 


0.31 




Mature 


30.2 


38.9 


25.7 


0.28 



higher than the grandparental generation estimates. All estimated specific 
gravity values (Fig. 2) were in the range typically reported for Populus 
species (3,11,20). 

Wood Composition 

Estimates of feedstock composition by conventional analytical 
approaches for the loblolly pine used as external calibration standards 
ranged from 26.7-34.7% for lignin, 35.7-48.0% for a cellulose, 24.4-25.9% 
for hemicellulose, and 0.28-0.34% for ash content (Table 1). Mature wood/ 
genotype tended to have lower lignin content, higher a cellulose content, 
and lower ash content as compared with juvenile wood from the same 
sample. Individual genotypes also varied in extreme values, with sample 
97-1 providing the highest estimate of lignin and sample 97-3 providing 
the highest cellulose values. Sample 97-2 tended to have average values for 
all tested components. Estimates for each component were determined 
separately without resorting to corrective subtraction, and thus, compo- 
nent values do not sum to 100. Based on these results, estimates for indi- 
vidual feedstock components were then used to calibrate the pyMBMS 
factor scores. 

PyMBMS projection of latent structure models were used to estimate 

lignin, a cellulose, glucan, specific gravity, extractives, and some individual 
hemicellulose sugar components, xylans, galactans, and mannans. The 
range of the chemical composition for each component, the correlation 
coefficient, and the root mean squared error of prediction (RMSEP) of the 
projection of latent structure models used to predict the loblolly pine chemi- 
cal compositions are shown in Table 2. The arabinan concentrations could 
not be estimated using pyMBMS methods owing to its low concentration 
and difficulty obtaining accurate arabinose estimates using conventional 
wet chemistry methods. The predictive model was derived from 26 samples 
rim in triplicate. The RMSEP is the average uncertainty that can be expected 
when predicting the concentrations for the unknown loblolly pine samples 
using this model. The results of the future predictions can be expected to be 
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Table 2 

Range of Phenotypic Standards, Correlation Coefficients, 
and Root Mean Squared Errors of Prediction (RMSEP) of the Loblolly Pine 
pyMBMS Projection of Latent Structure Regression Model 
Used to Predict Feedstock Composition 



Qiemical component 


Range Correlation coefficient 


RMSEP 


Lignin 


26.7-29.3 


0.92 


0.34 


a Cellulose 


41.5-48.0 


0.95 


0.89 


Extractives 


1.0-2.9 


0.91 


0.21 


Specific gravity 


0.38-0.57 


0.96 


0.017 


Glucans 


46.5-55.5 


0.85 


1.3 


Galactan 


2.6^.0 


0.83 


0.12 


Mannan 


8.8-11.9 


0.88 


0.47 


Xylan 


7.0-8.3 


0.86 


0.20 




Table 3 






Summary of pyMBMS Estimates of the Chemical Composition 




for a Segregating Pedigree 


of Loblolly Pine Sampled at the Fifth-Yr Growtii Ring 


Chemical component 


Phenotypic mean 


Range 


SD 


Lignin 


30.8 


28.7-33.1 


0.8 


a Cellulose 


36.4 


28.8-43.5 


2.7 


Extractives 


1.6 


0.8-2.1 


0.2 


Specific gravity 


0.27 


0.10-0.43 


0.06 


Galactan 


2.2 


1.1-3.0 


0.3 


Mannan 


7.5 


4.2-10.1 


1.0 


Xylan 


9.0 


8.0-10.1 


0.4 



within two RMSEP, The pyMBMS predictive model was reconciled using 
a full cross-validation scheme. 

Based on results of the projection of latent structure models, estimates 
of the overall chemical composition of the loblolly pine fifth-year growth 
ring for the different wood components are shown in Table 3. Figure 3 
shows the average triplicate estimates for a cellulose, lignin, and a hemicel- 
lulose component, mannan, determined using the pyMBMS analysis. The 
pyMBMS analysis clearly distinguishes between the wood sampled from 
the earlywood and latewood portion of the 5th-yr growth ring. The 
pyMBMS analysis correctly indicates a higher lignin content (Fig, 3B) and 
lower specific gravity (data not shown) in the earlywood portion of the ring 
with respect to the latewood portion of the ring consistent with previous 
reports (21,22). The analysis also determined that the concentrations of 
mannan (Fig. 3C) and galactose (data not shown) are lower in the early- 
wood than in the latewood, again, consistent with previous studies of soft- 
woods (23). The data also defines differences in the average chemical 
composition of the samples collected from trees grown in North Carolina 
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Fig. 3. PyMBMS predicted values for (A) a cellulose, (B) lignin, and (C) mannan . The 
earlywood and latewood samples from the North Carolina site are represented by • 
and O, respectively. The earlywood and latewood samples from the Oklahoma site are 
represented by x and - respectively. 



from those grown in Oklahoma (Fig. 3). The reason for these differences is not 
apparent. Finally, pyMBMS analysis showed that the grand parental and 
parental generations provided estimates of lignin content clustered within 
one-half standard deviation around the overall progeny mean (data not 
shown). Similar results were found for all other compositional components. 

QTL Analysis 

QTL were identified that were associated with each feedstock compo- 
nent examined in loblolly pine. From fifth-year growth ring pyMBMS data, 
two QTLs were identified for both earlywood and latewood lignin content, 
one mapping to the same position for both traits. These QTLs explain 
roughly 10% of the total variation for both lignin traits. Two additional 
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QTLs were associated with glucan content in earlywood, accounting for 
-^13% of the total variation. A single QTL was associated with both early- 
wood and latewood hemicellulose content and each accounted for approxi- 
mately 7% of the variation in hemicellulose. Interestingly, two positions on 
the loblolly pine genetic map, one on linkage group 8 and the other on 
linkage group 16, were associated with a large QTL cluster of several feed- 
stock components. The QTLs on linkage group 8 were related to galactans, 
mannans, xylans, and lignin contents from latewood. The QTLs on linkage 
group 16 were related to early wood and latewood galactans, mannans, 
xylans, and lignin contents. It may be possible that there are genes within 
the limits of the tested loblolly pine genome that influence overall cell-wall 
formation and are thus linked to individual cell-wall component estimates. 
If verified in a second, unrelated loblolly pine pedigree, the aforemen- 
tioned QTLs explain a sufficiently high enough portion of the total varia- 
tion in each feedstock component to merit a test of a marker-aided selection 
scheme for improved wood properties. 

In summary, CT X-ray densitometry and pyMBMS have proven to be 
reliable, accurate, and affordable methods for characterizing feedstock 
properties. Initial success with CT scan for hybrid poplar and pyMBMS for 
loblolly pine are currently being evaluated in reciprocal studies, i.e., CT 
scan for loblolly pine and pyMBMS for hybrid poplar. In the examined 
loblolly pine and hybrid poplar pedigrees, progeny from both species dis- 
played genetic variation in all examined feedstock components. QTL analy- 
sis suggested that there are positions within the genome of loblolly pine 
that are associated with feedstock composition and that many of the QTLs 
are tightly linked to one another. As such, these new techniques should 
facilitate genetic assessment of individual feedstock components, as well as 
genetic selection aimed at wood property trait improvement. 
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